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Abstract 

We demonstrate that certain weighted moments of tth production can 
provide a determination of the CP nature of a hght neutral Higgs boson 
produced and detected in this mode at the CERN LHC. 

1 Introduction 

It is well-known |jl|, Q that detection of a Higgs boson h with couplings roughly 
like those of the Standard Model (SM) Higgs (0°) will be possible at the LHC, 
the detection mode depending upon the mass rrih. If < 130 GeV, the primary 
techniques for observing such an h rely on gg — » h, W* Wh, and gg — >• tih, 
with /i ^ 77 and, for the latter two modes, h bb. If the h is part of a larger 
(non-SM) Higgs sector, deviations of the Wh and tih production rates from SM 
expectations will be present, but difficult to interpret. A direct determination of 
whether the h is CP-even (as predicted for the SM 0° and the minimal supersym- 
metric model would be especially crucial to unravelling the situation. The only 
procedure proposed to date for directly determining the CP nature of a light h 
employs proton beam polarization asymmetries that are only sufficiently large if 
proton polarization implies substantial polarization for the colliding gluons in the 
gg ^ h process 0. Here, we demonstrate that certain weighted moments of the 
cross section for tth production are sensitive to the relative magnitudes of the CP- 
even and CP-odd coupling coefficients c and d (respectively) in the tih interaction 
Lagrangian, C = i{c + id'-f^)th. The accuracy with which these moments can be 
measured experimentally at the LHC (assuming an accumulated, detector-summed 
luminosity of L = 600 fb~^) is sufficient that the CP-even SM Higgs boson can be 
shown to be inconsistent with an equal mixture of CP-odd and CP-even compo- 
nents at the ~ 1.5(7 — 2cr statistical level, and inconsistent with a purely CP-odd 
state at the > 7cr statistical level. 
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2 The Technique 



Consider a general quark-antiquark-Higgs coupling of the form Q{c + id'^^)Qh, 
where c and d are both taken to be real; c and d determine the CP-even and CP- 
odd components of the coupling, respectively. By explicit calculation, it can be 
demonstrated that the spin-averaged cross sections for gg —>■ QQh and qq — > QQh 
production contain no terms proportional to cd. However, they do contain terms 
proportional to both {c^ + d'^) and (c^ — d^). Since the (c^ — d^) terms are multiplied 
by m'^j^ sensitivity to {c^ — d^) terms is only significant if is of the same order as 
the other invariant masses- squared for the subprocess. The latter are fairly large, 
being set by the scale m|, implying that only tth production will have substantial 
sensitivity to (c^ — d^). For convenience, we adopt the conventional normalization 
of {(? -|- d^) = 1 in our tabulations below. 

In order to isolate the (c^ — d"^) terms in the production amplitude-squared 
in a manner that is free of systematic uncertainties associated with the overall 
production rate, we compute the ratio: 

_ SPcp] {dajpp ttX)/dPS} dPS 

J{dcx{pp^ttX)/dPS}dPS ' ^' 

where Ocp is an operator designed to maximize sensitivity of a to the (c^ — d'^) 
term. We have found that a variety of simple operators offer substantial sensitivity 
to (c^ — d^). The best of those that we have examined are 

^ _ (ptxn)-(ptxn) ^ _ (ptXra)-(pVxn) ^ _ (ptXn)-{pfXn) 
1 |(ptXn)-(pVxn)| 1 p[pi' 2 \pt\\Pt\ (^\ 

_ pIp^ , _ pfpf , _ pjp? \^) 

where pj^ denote the magnitudes of the t and t transverse momenta. In Eq. 0, n 
is a unit vector in the direction of the beam line and defines the z axis. One can 
also employ the y-component analogues of 02 and 63. Since all these operators will 
have somewhat different systematic uncertainties, it will be useful to analyze the 
CP properties using all of them (and, perhaps, others as well). Higher moments 
of the operators were also considered, but in all cases the linear moments were the 
most sensitive to changes in the c vs. d weighting. 

The critical question with regard to the usefulness of a given a is the accuracy 
with which it can be measured relative to the predicted changes in a as a function 
of the CP nature of the h. In general, there will be background as well as tth signal 
contributions in any ttX channel {X = 77 or bb). We define as and to be the 
value of a as defined in Eq. ^ for the signal and background cross sections on their 
own. We also define 

_ JjOcP? {dajpp ^ ttX)/dPS}dPS 
J{da{pp-^ttX)/dPS}dPS ' 



^This is in addition to the ni^ factor contained in the and squared couphngs themselves. 
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and the f5s and (3b values of (3 for the signal and background individually. Then, 
one finds that the experimental error for 05 is given by 



6aQ = S 



-1/2 



[3s-al + — {^I3b - 2aBas + a 



1I/2 



(4) 



where S and B are the total number of signal and background events, respectively, 
in the ttX channel being considered. This result assumes that B, as and Pb can 
be experimentally measured (using data with Mx not in the vicinity of rrih) and/or 
calculated with high precision. Note that (3 = 1 for the operators. 



Table 1: tt'yy channel ub and Pb values, assuming nih = 100 GeV. 



OcP 




hi 62 


02 


&3 


&4 


aB 


-0.863 


-0.796 -0.249 


-0.698 


-0.404 


0.130 


(3b 


1 


0.806 0.127 


1 


0.332 


0.411 



Table 2: Values of as and (3s for the tth production, assuming nih = 100 GeV, for 
the cases: c = 1, d = 0; c = d = l/V^; c = 0, d = 1. Also given is S^^'^Sas in the 
limit B = 0. 



OcP 


ai 


61 


h2 


a2 


h 


64 


c = l,rf = 














as 


-0.810 


-0.718 


-0.269 


-0.619 


-0.359 


0.292 




1 


0.736 


0.151 


1 


0.308 


0.376 


S^lHas 


0.586 


0.469 


0.280 


0.785 


0.424 


0.539 


c = d= I/V2 














as 


-0.742 


-0.654 


-0.243 


-0.562 


-0.327 


0.228 


Ps 


1 


0.707 


0.139 


1 


0.302 


0.372 


S^lHas 


0.671 


0.528 


0.283 


0.827 


0.442 


0.566 


c = 0,rf = 1 














as 


-0.486 


-0.407 


-0.147 


-0.335 


-0.200 


-0.005 


Ps 

S^'Has 


1 


0.593 


0.096 


1 


0.272 


0.371 


0.874 


0.654 


0.272 


0.942 


0.482 


0.609 



Let us now focus on the X = 77 channel. Detection of a Higgs boson in the tt77 
final state was originally proposed in Ref. 0] and has been thoroughly studied by 
both the ATLAS and CMS collaborations in their Technical Proposals P, ^. The 
final state employed is that in which one t decays semi-leptonically, while the other 
decays hadronically. This allows identification of the t vs. the t, and reconstruction 
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of the transverse momenta of both the t and t. This is aheady sufficient for defining 
ai^2 and 61^3 in Eq. ||. The 62,4 operators of Eq. || require knowledge of the full t 
and t three momenta. For the hadronically decaying t, the three jets can be used 
for this determination. Determination of the z component of momentum of the 
leptonically decaying t is subject to the usual two-fold ambiguity in determining 
the z component of the momentum of the unobserved neutrino using the missing 
transverse energy, mj, = 0, and the known value of m^. The algorithm in which 
is chosen so as to minimize the overall rapidity of the tth system yields the correct 
solution a large fraction of the time. 

We first present the values of a and f3 for signal and background, for the 
various different Ocp operators listed in Eq. H, assuming a Higgs boson mass 
of ruh = 100 GeV. These values include mild cuts on the outgoing t, t and photons 
of: \yt,t,-y\ < 4, p!^ > mh/4:. In actual practice, the detector collaborations must 
compute the expected numbers using their full cuts and resolutions. Results for 
as and Pb are given in Table |I], and results for 0:5 and Ps are given in Table p| 
for several choices of c, d. Also given are the values of S^^'^5as in the limit of zero 
background. These latter values give a first indication of the relative sensitivity of 
the various Ocp operators to different values of c, d. 

Table 3: ^77 channel, rrih = 100 GeV: discrimination powers in the limit B = 0. 



Ocp 


ai hi 62 02 63 &4 




0.117 0.137 0.091 0.073 0.076 0.118 
0.554 0.663 0.436 0.362 0.374 0.551 



In order to better interpret these results, we define the two discrimination 
powers: 

^ ^ \as{c = 1, d = 0) - as{c = d = 1/V2)\ 
^ Sas{c = l,d = 0) ' 

^ ^ \as(,c =l,d = 0) - asjc = 0,d=l)\ 
^ 6as{c = l,d = 0) ' 

(5) 

which indicate the ability of these observables to separate the SM-like case of 
c = 1, d = from the equal mixture d = and from the pure 

CP-odd case of c = 0, d = 1. These definitions of D12 are those appropriate if the 
observed Higgs boson is CP-even (so that it is appropriate to use Sas{c = l,d = 0) 
in computing the experimental error) .g Values for L'l.z/v^ are tabulated in Table |3| 

^If the h is pure CP-odd, D2 should be defined using 6as{c — 0,d = 1); if it is an equal 
CP-even/CP-odd mixture, Di should be defined using 6as{c — d — \/^/2). 
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in the limit where B = (no background). From this table, it is apparent that the 
operators ai, bi, 62 and 64 will be most useful, bi probably being the best, both in 
that it has the largest discrimination power, and also in that it can be constructed 
without use of 2;-component momenta. 

Table 4: tt77 channel, rrih = 100 GeV: the discrimination powers Di^2 for S = 130 
and B = 21. 



OcP 


ai 


bi 


b2 


^2 


^3 


&4 




1.26 


1.47 


0.97 


0.78 


0.81 


1.21 


D2 


5.97 


7.10 


4.67 


3.87 


3.97 


5.65 



Of course, in practice we must include the background in order to obtain the 
true value of 6as, see Eq. ^. Let us focus on the case where the h has SM-like 
couplings and ask how well we can measure as in this case. For this purpose, we 
employ the results from the CMS experimental Technical Proposal, Figure 12.8 (in 
which L = 162.5 fb~^ is assumed), scaled to the current standard benchmark in 
which it is assumed that each detector (ATLAS and CMS) will separately accumu- 
late an integrated luminosity of L = 300 fb~^, corresponding to a total detector- 
summed luminosity of L = 600 fb~^. For our estimates we will sum events in a 
5 GeV interval centered at M^^ = nih = 100 GeV. From the above-referenced Fig. 
12.8, one obtains, after rescaling, S = 259 and B = 42 for the sum of the W'j'j 
and tt77 modes. At the LHC, these two modes contribute almost equally, and so 
we divide these numbers in half to obtain the ^77 mode results of S* = 130 and 
B = 21. To quantify our ability to discriminate the SM case of c = l,d = from 
the c = d = l/\/2 and c = 0, d = 1 cases, we compute Di and D2. The resulting 
numerical values for Di^2 are given in Table ^ and indicate the level of statistical 
significance at which an observed CP-even h could be said to not be (1) an equal 
mixture of CP-odd and CP-even or (2) a purely CP-odd state. These values may 
be somewhat optimistic in that the above rates are those obtained before recon- 
structing the t and t. However, because of the cleanliness of the ^77 final state, 
we do not anticipate a large event rate loss for such reconstruction. 

As anticipated, the operators ai, bi, 62 and 64 provide the best discrimination. 
For the best single operator, bi, discrimination from the c = d = 1/V2 case is 
only achieved at the ~ 1.5cr level. To reach the more satisfactory 3cr level would 
require of order four times as much integrated luminosity. Discrimination between 
the purely CP-even case and the purely CP-odd case would be possible using bi 
at a high level of statistical significance, ~ 7a. Better statistical significance in 
both cases can be achieved to the extent that the different operators are sensitive 
to different aspects of the t and t distributions in the final state. In particular, 
64 is primarily sensitive to the t,t longitudinal momenta distributions, as distinct 
from the transverse momenta distributions that determine 61. Simply combining 
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the statistics for these two different moments would imply that c = 1, d = could 
be distinguished from c = d = 1/a/2 at nearly the 2a level, assuming the L = 
600 fb~ SM-like event rate. In reality, the experimental groups would undoubtedly 
obtain the best level of discrimination by studying the likelihood that a particular 
c, d mixture fits their data in an overall sense (without assuming knowledge of 
normalization). 

A second possible channel for this type of analysis is the X = bb channel {i.e. 
ttbb final state). The signal event rate is much larger than in the ti'j'j final state, 
but there are large backgrounds. This channel for Higgs discovery was first explored 
in Refs. |[, and has been studied by the ATLAS and CMS collaborations with 
generally encouraging results. These analyses isolate a signal by demanding that 
one of the fs decay semi-leptonically, and that three or four 6-quarks be tagged. 
The expected 6-tagging efficiency and purity at high luminosity are sufficiently large 
that statistically significant signals for a SM-like h can be achieved. In Ref . |Q , with 
L = 100 fb~^ and using 3-6-tagging, event rates are large enough that a roughly 5a 
signal is seen for ruh ~ 100 GeV, despite a small S/B ~ 1/40 signal to background 
ratio. The 4-6-tagging results of Ref. [0 yield a cleaner S/B ~ 1/2, but S is also 
reduced so that S/\fB for L = 100 fb~^ is at most increased to S/\fB ~ 8 — 10. 
(This latter study is not a full experimental simulation, and it is quite possible that 
these larger S/\fB values are too optimistic.) Neither of these analyses demands 
that the t-quarks be reconstructed, as would be required in order to construct the 
Ocp operators considered here. Such reconstruction could well improve the S/B 
ratios (in particular, by largely eliminating the combinatoric backgrounds), but 
undoubtedly at further sacrifice of signal event rate. Nonetheless, it seems probable 
that, for the combined ATLAS+CMS benchmark luminosity of L = 600 fb~^, 
S/\fB > 5 can be achieved after full t-quark reconstruction. We assume that after 
such reconstruction the only significant background will be from the irreducible 
tibb background process. (This is clearly the case if 4-6-tagging is employed.) In 
analyzing the experimental error on the determination of as for the different Ocp 
operators, we then need only compute the and Pb values for the irreducible 
tibb background. 



Table 5: ttbb channel as and Pb values, assuming rrih = 100 GeV. 



Ocp 




h 


b2 


a2 


bs 


&4 


CtB 


-0.552 


-0.478 


-0.177 


-0.395 


-0.239 


0.241 


Pb 


1 


0.642 


0.122 


1 


0.284 


0.375 



The values of and Pb for the cuts ly^ji^il < 4 and p^-^ > rrih/A are given 
in Table |^. The corresponding signal reaction as and Ps values for the various 
Ocp operators are unchanged from the results appearing in Table 0. Although 
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the cuts that will be required in the eventual analysis will be far more complex, 
the changes in a^, 13s, as and Pb are unlikely to be so large as to invalidate the 
estimates obtained below. 

Given as, Ps, C(b, and Pb, it is straightforward to compute the signal rate S for 
a SM-like Higgs that would be required to achieve Di = 2 (that is to discriminate a 
pure CP-even coupling from an equal CP-even/ CP-odd mixture at the 2a statistical 
level) as a function of B/S. The results are easily summarized. For B/S ~ 1, 
Di = 2 requires S ~ 700 (~ 600), equivalent to S/^B ~ 26 (~ 24), using 
Ocp = hi (h). For B/S 50, Di = 2 requires S ~ 20, 000 (~ 15, 000), equivalent 
to S/\/B ~ 20 (~ 17), for Ocp = &i (^4). The S values for 64 are a bit smaller 
than for hi, but probably not enough to compensate for inaccuracies associated 
with the need to use longitudinal momenta in constructing the former. Note the 
approximate rule that, for large B/S, S/ a/B ~ 20 is inevitably required to achieve 

~ 2 (using hi). The signal event rate required for D2 = 2, i.e. to distinguish a 
purely CP-even Higgs from a purely CP-odd Higgs at the 2a level, is only about 
1/20 that needed to achieve Di = 2 (for a given B/S). Indeed, ~ 2 is achieved 
whenever we have a S/ \fB ~ 4 — 5 Higgs boson signal! 

The above rules arise because Di^2 scale as S/\fB for large B/S, see Eq. ||. 
Considering both the 77 and hb channels and both Di and D2, we find, at large 
B/S, 

77: 1^1=2^, D, = 2^ 

hh: Di = 2^, D2 = 2'-I^ (6) 

when employing the operator hi (alone). In fact, the S/\/B values required for a 
given D12 value are remarkably independent of B/S once B/S > 2. For example, 
in the tihb channel, if 5/5" ~ 2 then = 2 is achieved for S/ ^J~B ~ 5. Of course, 
these large B/S scaling laws are not likely to be relevant for the ^77 channel, since 
B/S -C 1 is the general rule there. Once B/S <^ 1, D12 scale as v^: in the limit 
of -B = 0, one simply uses Table || to obtain the S required for a given D12. 

Overall, the tihb channel will probably not be as useful as the ^77 channel 
in the case of a SM-like CP-even Higgs boson. However, for a non-SM-like Higgs 
boson, BR{h — ^ 77) might be too small to yield an observable signal in the ^77 
final state, whereas BR{h — > hb) will generally be large and an observable signal 
in the tihb channel will be possible so long as the tth coupling is not significantly 
suppressed compared to SM strength. The most difficult example is the case of 
a purely CP-odd A^. The absence of the W^W~ coupling implies very small 
BR{A^ — i> 77). In a two-Higgs-doublet model of type-II, the ttA^ {hbA^) coupling 
is oc cot P (oc tan/?), where tan/3 is the ratio of the vacuum expectation values 
of the two neutral Higgs doublet fields [Q, 0]. In the preferred tan/? > 1 region 
of parameter space, the tiA^ production cross section, proportional to cot^ /5, is 
suppressed, but BR{A^ hb) is large (for m^o below 2mt). With L = 600 fb~^, 
it might prove possible to achieve S/\fB ~ 5 for tan/? < 2, which (assuming large 
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B/S and employing 5as{c = 0, = 1) for hi from Table |^ in defining D2) would 
imply D2 ~ 1.6. This would at least add support to the indirect evidence for a 
large CP-odd component deriving simply from the absence of observable W^+Higgs 
and tt77 channel signals. 



3 Final Remarks and Conclusions 

In this letter, we have shown that weighted moments of the iih production cross 
section can provide a rough but direct (cross section normalization independent) 
determination of the CP nature of a light Higgs boson at the LHC. In the ^77 final 
state, if we employ the best single weighting operator that does not depend upon 
the longitudinal momenta of the t quarks, then a SM-like Higgs boson (pure CP- 
even) can be distinguished from a Higgs boson that is an equal mixture of CP-odd 
and CP-even at the ~ l.Scr statistical level with about 130 signal events (assuming 
B/S <^ 1). This is roughly the number obtained for a combined ATLAS+CMS 
luminosity of L = 600 fb~ (assuming that the required t-quark reconstruction does 
not cause large losses in this final state). About 240 signal events (after t-quark 
reconstruction) are needed to achieve 2a discrimination. These same numbers of 
events would distinguish the purely CP-even Higgs from a purely CP-odd Higgs at 
the ~ 7a and ~ lOa level, respectively. If the best operator depending upon t-quark 
longitudinal momenta can also be employed, the above statistical significances 
would increase by about 40%. 

In the itbb channel, typically characterized by S/S" > 1 (possibly S> 1), 2a 
discrimination between a pure CP-even Higgs and one with an equal mixture of 
CP-odd and CP-even components requires S/^/B ~ 20, which would in general 
be very difficult to achieve (except for a Higgs with enhanced tiA^ coupling — re- 
quiring tan/5 < 1 in the CP- violating type-II two-Higgs-doublet model). However, 
distinguishing between purely CP-even and purely CP-odd coupling at the > 2a 
level is possible whenever S/ ^/B > 4, that is, whenever the Higgs boson can be 
detected. 

If both the ^77 and ttbb channels yield a useful level of discrimination, statistics 
in the two channels can be combined to further improve the overall discrimination 
level. 

Although our analysis has made use of rather simplified cuts, we do not believe 
that these results depend very much on the cuts. Nonetheless, the experimen- 
tal groups should compute the weightings defined in Eqs. and ^, for both the 
^77 and ttbb channels, using their full simulation, including the necessary t-quark 
reconstruction. 

A more detailed discussion of the implications of these results for a general 



CP- violating two-Higgs-doublet model will appear in a later paper |10 . 
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